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SCALABLE SELF-ROUTING SUPERCONDUCTOR SWITCH 

FIELD OF THE INVENTION 
[0001] The present invention relates to switches, and more particularly to a 
scalable self-routing superconductor switch. 

BACKGROUND OF THE INVENTION 
[0002] Superconductor switches are capable of very high data rates with low 
power dissipation. Crossbar switches that are implemented using superconductor 
digital electronics are capable of high data rates per line and are reconfigurable 
within a few clock cycles. The crossbar switches also have a compact size of a few 
cubic centimeters and low on-chip power dissipation. Network routers and multi- 
processor computers are a primary application of crossbar switches. For these 
applications, the crossbar switches must be self-routing. Using conventional 
crossbar switches in routers involves unacceptable overhead for the select logic. 
Conventional crossbar switches also need a parallel decoder circuit and an 
interconnect between the decoders and the crossbar switch. The row decoder also 
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has one output connection for each column. In other words, each cross-point would 
need to be connected to its own decoder output. 

[0003] While having high data rates, prior art superconductor switches have 
made compromises between data rate, power, scalability, total throughput, and 
reconfiguration time. Robert Sandell, John Spargo, and Michael Leung, "High Data 
Rate Switch With Amplifier Chip", IEEE Trans. Appl. Supercond., vol. 9, pp. 2985- 
2988, (June 1999) discloses a switch that operates at 2.5 Gbps and performs self- 
routing. However, the switch has high decoder and interconnect overhead. 
Scalability and power dissipation can also be improved. 

M [0004] Qing Ke, Bruce Dalrymple, Dale Durand, and John Spargo, "Single 

Q 

□ Flux Quantum Crossbar Switch", IEEE Trans. Appl. Supercond. vol. 7, pp. 2968- 

ry 

2 2971 , (June 1997) discloses a switch that employs rapid single flux quantum (RSFQ) 

% logic that enables ultra-high data rates but requires a parallel decoder. In addition, 

l A scalability and power dissipation are unacceptable. 

fU 

y A [0005] L. Wittie, D. Zinoviev, G. Sazaklis, and K. Likharev, "CNET: Design of 

tD 

□ an RSFQ switching network for petaflops-scale computing," IEEE Trans. Appl. 

jjss& 

Supercond., vol. 9, pp. 4034-4039, (June 1999) discloses a switch with a multi-stage 
Batcher-Baynan architecture that is implemented using RSFQ logic. Unfortunately, 
the switch has relatively high stage and interconnect complexity. Furthermore, the 
design has not been implemented. 

[0006] S. Yorozu, Y. Hashimoto, H. Numata, S. Nagasawa, and S. Tahara, 
"System demonstration of a superconducting communication system," IEEE Trans. 
Appl. Supercond., vol. 9, pp. 2975-2980, June 1999 discloses a switch that solves 
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the interconnect problem by using a ring architecture. Potential throughput, 
however, does not compare favorably to crossbar switches. 

[0007] In RSFQ logic, information is stored in superconductor loops as tiny 
magnetic flux quanta and a bit is transferred as several picosecond-wide voltage 
spike with a quantized area of approximately 2.07mV ps. The tiny and quantized 
nature of magnetic flux quanta significantly (by several orders of magnitude) reduces 
crosstalk and power consumption as compared to CMOS devices. 
[0008] The RSFQ circuit can be considered as consisting of elementary cells 
or timed gates. Each cell has two or more stable flux states. The cell is fed by SFQ 
u input pulses Si, S 2 , ... Si that can arrive from one or more signal lines and a clock 

b 

C3 timing line T. Each clock pulse marks a boundary between two adjacent clock 

m 

5j periods by setting the cell into its initial state. During the new period, an SFQ pulse 

% can arrive or not arrive at each of the cell inputs Sj. Arrival of the SFQ pulse at a 

terminal Si during the current clock period defines the logic value 1 of the signal Si 

hi 

y, while the absence of the pulse during this period defines the logic value 0 of this 

O signal. 

[0009] RSFQ circuits do not require the exact coincidence of SFQ pulses in 
time nor is a specified time sequence of the various input signals needed. Each 
input pulse can either change or not change the internal state of the cell. Input 
pulses cannot produce an immediate reaction at the output terminal(s) S ou t- Only the 
clock pulse T is able to fire out the pulse(s) S ou t corresponding to the internal state of 
the cell predetermined by the input signal pulses that have arrived during the clock 
period. The same clock pulse terminates the clock period by resetting the cell into 
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its initial state. An elementary cell of the RSFQ family is approximately equivalent to 
a typical asynchronous logic circuit coupled with a latch (flip-flop) that stores its 
output bit(s) until the end of the clock period. 

SUMMARY OF THE INVENTION 
[0010] A crossbar switch according to the invention includes a crossbar matrix 
with n input rows of cross-points and m output columns of cross-points. The 
crossbar switch further includes n decoders that are connected to the n input rows. 
The n th row includes a single serial address input connected to the n th decoder. 
[0011] In other features of the invention, each of the n rows includes a single 
shift input and a single data input. A serial address and data enters the address 
input and the data input in parallel. A shift sequence is transmitted on the shift 
input. The data flows before the shift sequence on the shift input is complete. 
[0012] In other features of the invention, the data is shifted through the 
crossbar switch using a clock that is generated on-chip using a clock recovery 
circuit. The decoder converts an N-bit binary address input into a serial address. 
The decoder includes an N-bit counter with N divide-by-2 toggle flip-flops. 
[0013] In still other features of the invention, the crossbar switch is 
implemented using superconductor digital electronics. In a preferred embodiment, 
the crossbar switch is implemented using rapid single flux quantum (RSFQ) logic. 
[0014] Further areas of applicability of the present invention will become 
apparent from the detailed description provided hereinafter. It should be understood 
that the detailed description and specific examples, while indicating the preferred 
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embodiment of the invention, are intended for purposes of illustration only and are 
not intended to limit the scope of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 





[0015] 


The present invention will become more fully understood from the 




detailed description and the accompanying drawings, wherein: 




[0016] 


FIG. 1 illustrates a crossbar switch according to the prior art; 




[0017] 


FIG. 2 illustrates the connection of the decoders to the cross-points in 




the crossbar switch of FIG. 1 ; 
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[0018] 


FIG. 3 illustrates a crossbar switch according to the present invention; 
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[0019] 


FIG. 4 illustrates the cross-points for a row of the crossbar switch in 


i.i 


FIG. 3; 
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[0020] 


FIG. 5 illustrates control and data signals for the crossbar switch of 




FIG. 3; 
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[0021] 


FIG. 6 illustrates the decoder for the crossbar switch of FIG. 3 in more 
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detail; 
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[0022] 


FIG. 7 is a schematic block diagram illustrating a preferred crosspoint; 




[0023] 


FIG. 8 illustrates Mealy state-transition diagrams for the non- 



destructive readout (NDRO), inverter (NOT), and destructive readout (DRO) circuits 
of FIG. 7; and 

[0024] FIG. 9 is a schematic of a NDRO gate showing a S ou t tap. 



5 



• # 

TRW Docket No. 12-1224 
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0025] The following description of the preferred embodiment(s) is merely 
exemplary in nature and is in no way intended to limit the invention, its application, 
or uses. 

[0026] The crossbar switch according to the present invention is preferably 
implemented using superconductor digital electronics. The crossbar switch is self- 
routing and is ideally suited for network routers and multi-processor computers. The 
crossbar switch is capable of 40 Gbps, 100 Gbps or greater data rates per input line. 
The crossbar switch has a reconfiguration time of a few clock cycles, a potential 
i,i switch size of 256 x 256, 1024 x 1024, or greater and a compact overall size of a few 

jF=J% 

P cubic centimeters. The crossbar switch has relatively low power dissipation of 

rU 

CO 0.1 to 1 Watt. 

O 

1: [0027] More particularly, the crossbar switch according to the present 

\£ 

L f= invention preferably provides a scalable self-routing switch using rapid single flux 

fU 

La, quantum (RSFQ) logic, which is a standard logic family. The crossbar switch 

\Q 

C3 includes the use of a single serial select input per row. The crossbar switch includes 

U 

select logic that allows the serial address and the data to enter the crossbar in 
parallel, which reduces configuration time to the absolute minimum amount of time 
(in other words, the amount of time that is required to enter the binary address bits 
and not more). The decoder logic converts the binary address input into a serial 
address. An on-chip clock is preferably generated using a clock recovery circuit. 
[0028] Referring now to FIGs. 1 and 2, a crossbar switch 10 according to the 
prior art is illustrated. The crossbar switch 10 includes a plurality of inputs 12-1, 12- 
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2, 12-3, 12-n that correspond to the n rows. The n inputs 12 are connected to n 
decoders 14-1, 14-2, 14-3, 14-n. Outputs of the decoders 14 are connected to 
cross-points 16-11, 16-12, 16-13, 16-nm that are associated with the n rows and 
with m output columns 18-1, 18-2, 18-3, 18-m. 

[0029] In use, serial data including an address header and the data payload 
enter the crossbar switch 10 on one of the serial inputs 12. Each of the decoders 14 
operate independently. One of the decoders 14 (that is associated with the serial 
input 12 receiving the serial data) strips off the address header, selects one of the 
output columns 18, and passes the data to the selected output column 18. 
[0030] Referring now to FIG. 2, the crossbar switch 10 of FIG. 1 is illustrated 
in further detail. Each decoder 14 includes a separate select input 20-1 1, 20-12, 20- 
13, 20-nm for each cross-point 16-11, 16-12, 16-13, 16-nm. As can be 
appreciated, the individual select inputs 20 that are required to select the cross- 
points 16 increase the power dissipation and overall size of the crossbar switch 10. 
In addition, the select inputs increase the time that is required to configure the 
crossbar switch 10. 

[0031] Referring now to FIG. 3, a crossbar switch 100 according to the 
present invention is illustrated. The crossbar switch 100 includes inputs 102-1, 102- 
2, 102-3, 102-n that correspond to the n rows of the crossbar switch 100. The 
crossbar switch 100 further includes n decoders 104-1, 104-2, 104-3, 104-n that 
correspond to the n inputs 102. The crossbar switch 100 includes cross-points 110- 
11, 110-12, 110-13, 110-nm that are connected to output columns 114-1, 114-2, 
114-3, 1 14-m of the crossbar switch 100. The rows (1 ... n) of the crossbar switch 
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100 include a serial address input 116 (116-1, 116-2, 116-3 116-n), a shift input 

118 (118-1, 118-2, 118-3, .... 118-n), and a data input 120 (120-1, 120-2, 120-3, .... 
120-n). 

[0032] Referring now to FIGs. 4 and 5, an exemplary row 130 of the 
crosspoints 110 is shown. The cross-points 110 include reset 132, address 136, 
shift 138, data 140, and clock inputs 142. The cross-points 110 include a cross- 
point input 144 and a cross-point output 148. The select logic allows the serial 
address header and the data to enter the crossbar switch 100 in parallel. This 
means that data can flow before the shift sequencing is completed. In effect, the 
configuration time of the crosspoints 100 is reduced nearly to zero. 
[0033] As can be appreciated from FIG. 5, a reset signal 150 deselects all 
output columns 114. An address input 136 selects column one. M-1 shift pulses 
154 select column M. A clock signal 158 shifts data 156 through the array. The 
clock signal 158 can be generated on-chip using known clock recovery circuits. For 
example, U.S. Patent No. 5,963,351 (which is hereby incorporated by reference) 
illustrates a suitable on-chip clock recovery circuit. 

[0034] Referring now to FIG. 6, the logic for the decoders 104 is illustrated. 
The decoder 104 converts the binary address input into a serial address. The binary 

address is an N-bit binary address. For m output columns, the relation 2 N > m must 

be satisfied. The decoder 104 includes an N-bit counter 150 that includes N toggle 
flip-flops (TFF) 152. In a preferred embodiment, the TFFs 152 are divide-by-2 T1FF 
flip-flops which are standard RSFQ gates. The decoder 104 receives a reset input 
156 and a clock input 158. For example, bit 0 of the N-bit binary address is merged 
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at 160 with the clock input 158. Bit 1 of the N-bit binary address is merged at 164 
with an output of the TFF 152-1. Bit 2 of the N-bit binary address is merged at 166 
with an output of the TFF 152-2. An output of the TFF 152-3 is a carry output. In 
use, the counter is initially reset. The N-bit binary address sets an initial state of the 
N-bit counter 150. The counter is clocked and a carry output is produced after 2 N - 
address clock cycles. 

[0035] The crossbar switch 100 according to the present invention is 
preferably implemented using superconductor digital electronics. In a highly 
preferred embodiment, the crossbar switch 100 is implemented using rapid single 
hk flux quantum (RSFQ) logic. The crossbar switch 100 is self-routing and is ideally 

C3 suited for network routers. The crossbar switch is capable of 40 Gbps data rates per 

ru 

jj| line. The crossbar switch provides a reconfiguration time equal to the length of the 

X binary address (i.e. a few clock cycles), a switch size of 256 x 256, 1024 x 1024, or 

L greater and a compact overall size of a few cubic centimeters. The crossbar switch 

ru 

m has relatively low power dissipation of approximately 0.1 to 1 Watt. 

□ [0036] The crossbar switch 100 includes the use of a single serial select input 

per row. The crossbar switch 100 includes select logic that allows the serial address 
and the data to enter the crossbar in parallel, which reduces configuration time in the 
crosspoints to one clock cycle. The logic in the decoder 104 converts the binary 
address input into the serial address. The scalable self-routing superconductor 
switch according to the invention addresses the speed, power dissipation and 
overhead problems that are associated with conventional superconductor switches. 
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[0037] Referring now to FIGs. 7 and 8, a preferred embodiment of the cross- 
point 110 is shown. For purposes of clarity, reference numerals from Fig. 4 have 
been used where appropriate to identify similar elements. The cross-point 110 
includes a nondestructive readout (NDRO) circuit 170, an inverter (NOT) circuit 174, 
a destructive readout (DRO) 176, and a merge circuit 178. Preferably the NDRO, 
DRO and NOT circuits are known and standard RSFQ circuits. 
[0038] The NDRO circuit 170 has a set input connected to the address line 
136, an "in" input that is connected to the clock line 140, and a reset input that is 
connected to the shift line 138. The NDRO circuit 170 has an S ou t output that is 
connected to a clock input of the inverter circuit 174. The NDRO circuit 170 has an 
"out" output that is connected to a clock input of the DRO circuit 176. The Mealy 
state-transition diagram in FIG. 8 illustrates the operation of the NDRO circuit 170. 
[0039] The inverter circuit 174 has an "in" input that is connected to the reset 
line 132 and an "out" output that is connected to an adjacent cross-point. The Mealy 
state-transition diagram in FIG. 8 illustrates the operation of the inverter circuit 174. 
The DRO circuit 176 has an "in" input that is connected to the data line 140 and an 
"out" output that is connected to the merge circuit 178. The Mealy state-transition 
diagram in FIG. 8 illustrates the operation of the DRO circuit 176. Skilled artisans 
will appreciate that the cross-points 110 may be implemented many other ways 
without departing from the spirit and scope of the invention. 

[0040] Referring now to FIG. 9, an NDRO gate 200 with two outputs (out and 
Sout)- The NDRO gate includes a dc voltage bias 202 connected to a resistor 206 
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and inductors 210 and 212. Set, reset, and inputs are connected by Josephson 
junctions 216. 

[0041] Those skilled in the art can now appreciate from the foregoing 
description that the broad teachings of the present invention can be implemented in 
a variety of forms. Therefore, while this invention has been described in connection 
with particular examples thereof, the true scope of the invention should not be so 
limited since other modifications will become apparent to the skilled practitioner upon 
a study of the drawings, the specification and the following claims. 
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